The WASABI Dataset: Cultural, Lyrics and Audio Analysis Metadata About 2 Million Popular Commercially Released Songs

نویسندگان

چکیده

Since 2017, the goal of two-million song WASABI database has been to build a knowledge graph linking collected metadata (artists, discography, producers, dates, etc.) with generated by analysis both songs’ lyrics (topics, places, emotions, structure, and audio signal (chords, sound, etc.). It relies on natural language processing machine learning methods for extraction, semantic Web frameworks forrepresentation integration. describes more than 2 millions commercial songs, 200K albums 77K artists. can be exploited music search engines, professionals (e.g. journalists, radio presenters, teachers) or scientists willing analyze popular published since 1950. is available under an open license, in multiple formats online source services including interactive navigator, REST API SPARQL endpoint.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

All About Audio Metadata

Metadata, the “data about the audio data” that travels along with the multichannel audio bitstream in Dolby Digital, makes life easier for broadcasters while also increasing the creative ability of audio mixers. For broadcasters, audio metadata now means they have a set-and-forget solution, instead of monitoring, compressing, and adjusting levels all over the plant. For audio mixers, this means...

متن کامل

Genre Classification of Spotify Songs using Lyrics, Audio Previews, and Album Artwork

This paper is an attempt to attack the problem of genre classification of music from a variety of angles. Three different types of data (song previews, album artwork, and lyrics) are used to train three models (a Recurrent Neural Network, k-Nearest Neighbors, and Naive Bayes, respectively) and the outputs of the three are again combined to classify a given song. The combined model was able to a...

متن کامل

The Million Song Dataset

We introduce the Million Song Dataset, a freely-available collection of audio features and metadata for a million contemporary popular music tracks. We describe its creation process, its content, and its possible uses. Attractive features of the Million Song Database include the range of existing resources to which it is linked, and the fact that it is the largest current research dataset in ou...

متن کامل

A Preliminary Study on a Recommender System for the Million Songs Dataset Challenge

In this paper the preliminary study we are conducting on the Million Songs Dataset (MSD) challenge is described. The task of the competition is to suggest a set of songs to a user given half of its listening history and complete listening history of other 1 million people. We focus on memory-based collaborative filtering approaches since they are able to deal with large datasets in an efficient...

متن کامل

Identifying singers of popular songs

In this paper, we propose to identify the singers of popular songs using vibrato characteristics and high level musical knowledge of song structure. The proposed framework starts with a vocal detection process followed by a hypothesis test for the vocal/non-vocal verification. This method allows us to select vocal segments of high confidence for singer identification. From the selected vocal se...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2021

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-030-77385-4_31